Perceptual Metrics for Image Database Navigation
نویسندگان
چکیده
ii I certify that I have read this dissertation and that in my opinion it is fully adequate, in scope and quality, as a dissertation for the degree of Doctor of Philosophy. Carlo Tomasi Principal Adviser I certify that I have read this dissertation and that in my opinion it is fully adequate, in scope and quality, as a dissertation for the degree of Doctor of Philosophy. I certify that I have read this dissertation and that in my opinion it is fully adequate, in scope and quality, as a dissertation for the degree of Doctor of Philosophy. Preface The increasing amount of information available in today's world raises the need to retrieve relevant data eeciently. Unlike text-based retrieval, where keywords are successfully used to index into documents, content-based image retrieval poses up front the fundamental questions how to extract useful image features and how to use them for intuitive retrieval. We present a n o vel approach to the problem of navigating through a collection of images for the purpose of image retrieval, which leads to a new paradigm for image database search. We summarize the appearance of images by distributions of color or texture features, and we deene a metric between any two such distributions. This metric, which we call the "Earth Mover's Distance" EMD, represents the least amount of work that is needed to rearrange the mass is one distribution in order to obtain the other. We show that the EMD matches perceptual dissimilarity better than other dissimilarity measures, and argue that it has many desirable properties for image retrieval. Using this metric, we employ Multi-Dimensional Scaling techniques to embed a group of images as points in a two-or three-dimensional Euclidean space so that their distances reeect image dissimilarities as well as possible. Such geometric embeddings exhibit the structure in the image set at hand, allowing the user to understand better the result of a database query and to reene the query in a perceptually intuitive w ay. By iterating this process, the user can quickly zoom in to the portion of the image space of interest. We also apply these techniques to other modalities such as mug-shot retrieval. iv Acknowledgements I w ould like to thank the many people who made my time at Stanford a period I will treasure. First and foremost, I would like to thank my advisor, Professor Carlo Tomasi, for guiding me …
منابع مشابه
Visual Navigation in Perceptual Databases
In this paper we present our ideas on similarity based image databases, and a new type of interface that we are developing for navigation in a databse of images. The interface facilitates navigation in a display space whose geometric characteristics depend on the geometry of the perceptual space in which image similarity is measured. The display space is a subset of the three dimensional Euclid...
متن کاملPerceptual quality assessment of 3D dynamic meshes: Subjective and objective studies
Nowadays, 3D mesh animations have been increasingly used in various applications, e.g., in digital entertainment and physically-based simulation. In many applications, it is possible that a surface animation undergoes some lossy operations which can impair its perceptual quality. Since the end users of mesh animations are often human beings, the perceptual quality assessment of 3D dynamic meshe...
متن کاملBuilding structural similarity database for metric learning
We propose a new approach for constructing databases for training and testing similarity metrics for structurally lossless image compression. Our focus is on structural texture similarity (STSIM) metrics and the matchedtexture compression (MTC) approach. We first discuss the metric requirements for structurally lossless compression, which differ from those of other applications such as image re...
متن کاملImage authentication using LBP-based perceptual image hashing
Feature extraction is a main step in all perceptual image hashing schemes in which robust features will led to better results in perceptual robustness. Simplicity, discriminative power, computational efficiency and robustness to illumination changes are counted as distinguished properties of Local Binary Pattern features. In this paper, we investigate the use of local binary patterns for percep...
متن کاملA Qualitative Meta-analysis of Perceptual-motor Problems in Visually Impaired People
Introduction: Perceptual motor activities improve motor skills and learning. These skills play an effective role in receiving, interpreting and responding to the sensory stimuli. This study aimed to identify perceptual-motor problems in visually impaired people. Methods: This qualitative research was conducted using a research synthesis method. Therefore, the analysis unit consisted of all the...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999